CDS

Accession Number TCMCG075C05051
gbkey CDS
Protein Id XP_017971749.1
Location complement(join(2376015..2376106,2376541..2376731,2376838..2376961,2377561..2378215))
Gene LOC18607418
GeneID 18607418
Organism Theobroma cacao

Protein

Length 353aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018116260.1
Definition PREDICTED: lysM domain-containing GPI-anchored protein 2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description lysM domain-containing GPI-anchored protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0005911        [VIEW IN EMBL-EBI]
GO:0008061        [VIEW IN EMBL-EBI]
GO:0008144        [VIEW IN EMBL-EBI]
GO:0009506        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0030054        [VIEW IN EMBL-EBI]
GO:0031224        [VIEW IN EMBL-EBI]
GO:0031225        [VIEW IN EMBL-EBI]
GO:0031226        [VIEW IN EMBL-EBI]
GO:0044425        [VIEW IN EMBL-EBI]
GO:0044459        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0046658        [VIEW IN EMBL-EBI]
GO:0055044        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]
GO:0097367        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGTTTTGCTTTTGCTAAGCTGTTTCTTCTTCTCCTTCCGTTGCTCTCATCTCTGACTCTAGAGCACTCCGCGGCTCAAGGCTTCAACTGTAGCTCCCCGAGATCATGTCGTGCCCTGGTCGGCTACGTCACCGTCAACAACACCGACCTCGGCACCATTCAATCTCTTTTCAACGTCAAGAACTTCCGGAGTATCCTCGGAGCTAACGGCTTATCTCTCTCTACTCCACGCACCCACAACATATCTGCACAACAAGTCATCAAAATCCCCATCAACTGCGTTTGCTACAACGACACCGGAACTTCCAGCGGCGCTCCAATCTACGAGGTGAAAGAAGGTGACTTTCTCTTCCACATAGCAGCTGAGATTTTCTCGAGGTTAGTGACGTTCCAGCAAATTACGGAAGCCAATGGGATTGGGAATTCCAGTTTGATAATGCCCGGGCAGAAGCTGAAAATCCCGTTACCGTGTAGCTGTGATGACGTGAACGGCGAGAAAGTGGTGCATTATGCACATATTGTGAAGTTAGGGAGTACCTTGGAGGGGATTGCTAGTGAGTTTGGAACTGATGAAGGGACTTTGCGTAGGGTTAATAACATCACCGCCGATAATCAGTTAATAGCTGACCAACCGATCGATGTTCCTCTCAAAGCCTGCAACTCACCAATAAGAAGTGACTCGTTGGACTTTCCTTTACTTGCTGCTAATGGAACATACGTCTTCACTGCTAATGGTTGTGTGAGGTGCACATGTGATGCTGCTGTTAACAACTCGACATTACGTTGTGAACCATCCCAGAATAAACCATCCAGGTGGGAGACGTGCCCATCTATGCAATGTGAAGCTTCAGATGGTTTATCCCTTGGCAATAGTACCACTTCTGGTTGCAATCGCACAACCTGTTCCTATGCTGGATATAACAACTCAACCATCTTCACAACCCTTGAACAGGACTCCACTTGTTCATCAACTACTCCAAGCAATGATGTTACAAGGATCAGTTTGAATTGGGACTTTCTATGCATCTTGATCTTGCTTTGCTTTCATCTCTTCCAGTAA
Protein:  
MGFAFAKLFLLLLPLLSSLTLEHSAAQGFNCSSPRSCRALVGYVTVNNTDLGTIQSLFNVKNFRSILGANGLSLSTPRTHNISAQQVIKIPINCVCYNDTGTSSGAPIYEVKEGDFLFHIAAEIFSRLVTFQQITEANGIGNSSLIMPGQKLKIPLPCSCDDVNGEKVVHYAHIVKLGSTLEGIASEFGTDEGTLRRVNNITADNQLIADQPIDVPLKACNSPIRSDSLDFPLLAANGTYVFTANGCVRCTCDAAVNNSTLRCEPSQNKPSRWETCPSMQCEASDGLSLGNSTTSGCNRTTCSYAGYNNSTIFTTLEQDSTCSSTTPSNDVTRISLNWDFLCILILLCFHLFQ